MIRACLE's Hybrid Approach to Bilingual and Monolingual Information Retrieval
نویسندگان
چکیده
The main goal of the bilingual and monolingual participation of the MIRACLE team in CLEF 2004 was to test the effect of combination approaches on information retrieval. The starting point was a set of basic components: stemming, transformation, filtering, generation of n-grams, weighting and relevance feedback. Some of these basic components were used in different combinations and order of application for document indexing and for query processing. A second order combination was also tested, mainly by averaging or selective combination of the documents retrieved by different approaches for a particular query.
منابع مشابه
MIRACLE's 2005 Approach to Cross-lingual Information Retrieval
This paper presents the 2005 Miracle’s team approach to Bilingual and Multilingual Information Retrieval. In the multilingual track, we have concentrated our work on the merging process of the results of monolingual runs to get the multilingual overall result, relying on available translations. In the bilingual and multilingual tracks, we have used available translation resources, and in some c...
متن کاملA New Hybrid Approach for Chinese-English Query Translation
A new hybrid approach combining the dictionary-based and corpus-based approaches for ChineseEnglish cross-language information retrieval is proposed. The bilingual dictionary provides the translation equivalents of each query term. And the word co-occurrence information trained from a monolingual corpus can be used to disambiguate the translation. Further, we investigate the roles of phrase-lev...
متن کاملMIRACLE's 2005 Approach to Monolingual Information Retrieval
This paper presents the 2005 Miracle’s team approach to Monolingual Information Retrieval. The goal for the experiments in this year was twofold: continue testing the effect of combination approaches on information retrieval tasks, and improving our basic processing and indexing tools, adapting them to new languages with strange encoding schemes. The starting point was a set of basic components...
متن کاملISCAS at NTCIR-3: Monolingual, Bilingual and MultiLingual IR Tasks
This paper reports the methods and procedures we took in the CLIR track of NTCIR3. In monolingual subtasks we mainly describe index representation and word segmentation method. We use hybrid model integrating MT approach and dictionary-based approach in bilingual and multilingual subtasks. The method for combining the monolingual retrieval results to produce the final rank list is explained in ...
متن کاملDeriving a Bilingual Lexicon for Cross Language Information Retrieval
In this paper we describe a systematic approach to derive a bilingual lexicon automatically from paral lel corpora Following this approach a lexicon was derived from the English and Dutch version of the Agenda corpus With the lexicon and a part of the corpus that was not used to derive the lexicon a bilingual retrieval environment was build Recall and precision of monolingual Dutch retrieval wa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004